An Automatic Lip-reading Method Based on Polynomial Fitting
نویسنده
چکیده
This paper addresses the problem of speaker-dependent isolate digits recognition using sole visual information. We employ intensity transformation and spatial filter to estimate the minimum enclosing rectangle of mouth in each frame. Thus, for each utterance, we can obtain two vectors composed of width and height of mouth, respectively. Then, we propose an approach to recognize the speech based on polynomial fitting. Firstly, both width and height vectors are normalized into the constant length via interpolation. Secondly, least square method is utilized to produce two 3order polynomials that can represent the main trend of the two vectors, respectively, and reduce the noise caused by the estimate error. Lastly, positions of three crucial points (i.e. maximum, minimum and right boundary point) in each 3-order polynomial curve are recorded as a feature vector. For each utterance, we calculate the average of all vectors of training sample to make a template, and using Euclidean distance between the template and testing data to perform the classification. Experiments show the promising results of the proposed approach.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملAutomatic Hybrid Approach for Lip POI Localization: Application for Lip-reading System
Automatic Lip-reading system is one of the different assistive technologies for hearing impaired or elderly people. We can imagine, for example, a dependent person ordering a machine with an easy lip movement or by a simple visemes (visual phoneme) pronunciation. The need for an automatic lip-reading system is ever increasing. The lip-reading system is decomposed in three subsystems, first we h...
متن کاملAutomatic Lip Reading for Daily Indonesian Words Based on Frame Difference and Horizontal-vertical Image Projection
Automatic lip reading is one of research being developed lately. Automatic lip reading has been used for various purposes, such as enhancing speech recognition and aid to speech training for the deaf. There are two approaches in lip feature extraction, namely appearance based and shape based. Appearance based approach is usually better, because it provides visual features that cover not only li...
متن کاملLip Tracking Towards an Automatic Lip Reading Approach
Current era is to make the interaction between humans and their artificial partners (Computers) and make communication easier and more reliable. One of the actual tasks is the use of vocal interaction. Speech recognition may be improved by visual information of human face. In literature, the lip shape and its movement are referred to as lip reading. Lip reading computing plays a vital role in a...
متن کاملLip Contour Detection Techniques Based on Front View of Face
Lip contour detection and tracking is the most important pre-requisite for computerized speech reading. Several approaches have been proposed for lip tracking after lip contour is accurately initialized on first frame. Detection and tracking of the lip contour is an issue in speech reading. A relatively large class of lip reading algorithms are available based on lip contour analysis. In these ...
متن کامل